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IDENTmCATION OF INPUT FILES USING REFERENCE FILES 
ASSOCIATED WITH NODES OF A SPARSE BINARY TREE 

CROSS REFERENCE TO RELATED APPLICATIONS 

[0001] This ^plication claims priority to United States Provisional 
Application Serial No. 60/568,881 filed May 6, 2004, which is incorporated 
herein by reference; and is a continuation-in-part of commonly-ovmed United 
States Application Serial No. 10/472,458, filed September 19, 2003, entitled 
**Method and Apparatus for Identifying Electronic Files," which is also 
incorporated herein by reference. 

FIELD OF T WR TNVRNTTON 

[0002] The present invention generally relates to techniques for 
identifying digitized samples of time varying signals and in particular, to a 
method and apparatus for identifying input files using reference files associated 
with nodes of a sparse binary tree. 

BACKGROUND OF THE INVENTION 

[0003] In searching for particular audio files on the Memet, it is useful to 
be able to determine the identity of untitled audio files as well as to confirm that 
titled audio files are what they purport to be. Although a human may conceivably 
make such detemiinations and confirmations by simply listening to the content of 
the audio files by playing them ttm>ugh a media player, such an approach is not 
always reliable. Also, a process such as this involving human judgment is 
inherently very slow. 

[0004] Therefore, it is advantageous to employ a computer to determine 
the identity of untitled audio files as well as to confirm that titled audio files are 
what they purport to be. The computer can not only store a lot of information to 
assist in identifying an ioput audio file, it can also process that information very 
quickly. 
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[0005] In one technique employing a computer, an algorithm is used to 
uniquely identify audio file content Using this approach, a master code is 
geuCTated by performing the algorithm on content in a master audio file. By 
^plying the same algorithm to the content of an input audio file, the calculated 
code may then be compared ^th the master code to determine a match. 

[0006] Use of such an algorithm, however, does not always lead to proper 
identification, because ttie content of an audio file may not have exactly the same 
length of recording as the content of the master audio file, for example, by 
starting at a point a little later in time, thus giving rise to a calculated code that 
would not match the master code. Also, if the content of flie input audio file 
contains noise spikes or background noise, this would also give rise to a 
calculated code that would not match the master code. . Thus, in both of these 
cases, file stored content is not properly identified. 

OBJECTS AND SUMMARY OF THE INVENTION 

[0007] Accordingly, one object of the present invention is to provide a 
method and q>paratus for identifying input files that are reliable even if their 
content is of&et in time, or contains noise spikes or background noise. 
[0008] Another object is to provide a method and apparatus for 
identifying input files that are computationally &5t when performed in a 
computer system. 

[0009] Another object is to provide a method and apparatus for 
identifying input files that minimize data storage requirements in a computer 
system. 

[0010] These and other objects are acconq)lished by the various aspects of 
the present invention, wherein briefly stated, one aspect is a method for matching 
an input audio file with reference audio files, comprising: identifying potential 
matches of an input audio file among reference audio files based upon at leaist one 
common characteristic; and searching for a match of the ii^ut audio file among 
the potential matdies. 

2 
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[0011] Another aspect is a method for matching an input audio file with 
reference audio files, comprising: identifying potential matches of an input audio 
file among reference audio files based upon at least one common characteristic; 
and conaqparing an input profile resulting firom a measurable attribute of the input 
audio file with reference profiles resulting from the same measurable attribute of 
the potential matches to detemiine a match. 

[0012] Another aspect is a method for matching an iiq)ut file with 
reference files, comprising: identifying potential matches of an inpvX file among 
reference files by associating nodes of a sparse binary tree with the input file in a 
same manner used to associate nodes of the sparse binary tree with the reference 
files; and searching for a match of the iiq)ut file among the potential noiatches.. 
[0013] Another aspect is a method for matching an input file with 
reference files, comprising: identifying potential matches of an input file among 
reference files by associating nodes of a sparse binary tree with &e input file in a 
same manner used to associate nodes of the sparse binary tree with the reference 
files; and compariag a profile resulting fix)m a measxirable attribute of the input, 
file with profiles resulting firom the same measurable attribute of the potential 
matches to determine a match. 

[0014] Another aspect is a method for matching an input audio file with 
reference audio files, comprising: generating an irxpnt profile fi'om an ii^ut audio 
file based upon a measurable attribute also used to generate reference profiles 
fix>m reference audio files; identifying potential matches among the reference 
profiles with the input profile by processing the input profile in a manner used to 
associate individual of the reference profiles wifli nodes of a sparse binary tree; 
and comparing the input profile wi& the potential matches to determine a match. 

[0015] Still another aspect is a method for matching an ii^ut audio file 
with reference audio files, comprising: generating reference profiles fiom 
reference audio files using a measurable attribute; generating a sparse binary tree 
by qsplying a process to the reference profiles such that identifications of the 
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reference profiles are associated at resulting nodes of the sparse binary tree; 
generating an input profile bom the iiq)Ut audio file using the measurable 
attribute; applying the process to the input profile so ttiat associated reference 
profiles are identified &om resulting nodes of the sparse binary tree; and 
coii5)aring at least a portion of the iiq)ut profile with corresponding portions of 
the identified reference profiles to determine a match. 
[0016] Another aspect is an apparatus for matching an input audio file 
with reference audio files, comprising at least one conoputer configured to: 
identify potential matches of an input audio file among reference audio files 
based upon at least one conmion characteristic; and search for a match offht 
input audio file among the potential matches. 

[0017] Another aspect is an apparatus for matching an input audio file 
with reference audio files, comprising at least one computer configured to: 
identify potential matches of an input audio file among reference audio files 
based upon at least one common characteristic; and compare an input profile 
resulting &om a measurable attribute of the input audio file with reference 
profiles resulting fix)m the same measurable attribute of the potential matches to 
determine a match. 

[0018] Anoflier aspect is an apparatus for matching an input file with 
reference files, con^rising at least one computer configured to: identify potential 
matches of an inpvit file among refwence files by associating nodes of a sparse 
binary tree with the input file in a same manner used to associate nodes of the 
sparse binary tree with the reference files; and search for a match of the input file 
among the potential matches. 

[0019] Another aspect is an apparatus for matching an input file with 
reference files, comprising at least one coniputer configured to: identify potential 
matches of an mput file among reference files by associating nodes of a sparse 
binary tree witii the iiq)ut file in a same maimer used to associate nodes of the 
sparse binary tree with the reference files; and con^are a profile resulting fixmi a 
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measurable attribute of the input file with profiles resulting firom the same 
measurable attribute of the potential matches to determine a match. 
[0020] Another aspect is an 25)paratus for matching an input audio file 
with reference audio files, comprising at least one computer configured to: 
generate an input profile fi-om an input audio file based ttpon a measurable 
attribute also used to generate reference profiles firom reference audio files; 
identify potential matches among the reference profiles with the hsput profile by 
piocessmg the input profile in a manner used to associate individual of the 
reference profiles with nodes of a sparse binary tree; and conq)are the input 
profile with the potential matches to determine a match. 
[0021] Yet another aspect is an apparatus for matching an input audio file 
with reference audio files, comprising at least one computer configured to: 
generate reference profiles firom reference audio files using a measurable 
attribute; generate a sparse binary tree by applying a process to the reference 
profiles such that identifications of the reference profiles are associated at 
resulting nodes of the sparse binary tree; generate an input profile firom the input 
audio file using the measurable attribute; apply the process to the input profile so 
that associated reference profiles are identified firom resulting nodes of the sparse 
binary tree; and compare at least a portion of the iiq>ut profile with corresponding 
portions of the identified reference profiles to detennine a match. 
[0022] Additional objects, features and advantages of the various aspects 
of the present invention will become parent finom flie following description of 
its preferred anbodunent, which description should be taken in conjunction witii 
the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0023] FIG. 1 illustrates a data flow diagram for an audio matcher 
program, utilizing aspects of the present invention. 
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[0024] HG. 2 illustrates a flow diagram of a method for generating 
reference profiles and a reference proffles tree, utilizing aspects of the present 
invention. 

[00251 PIG. 3 illustrates a flow diagram of a mediod for generating 
profiles j&om digitized audio clips, utilizing aspects of the present invention. 

[0026] FIG. 4 illustrates a flow diagram of amethod for generating a 
refCTence profiles tree, utilizing aspects of the present invention. 

[0027] FIG. 5 illustrates a flow diagram of a method for associating 
reference profile infomiation with nodes of a reference profiles tree, utilizing 
aspects of the present invention. 

[0028] FIG. 6 illustrates a flow diagram of a method for storing a profile 
hoolc into a reference profiles tree for each chunk offset of a reference profile 
identified as being a local maximum, utilizing aspects of the present invention. 
[0029] FIG. 7 illustrates a flow diagram of a method for generating an 
input profile &om an input audio clip. 

[0030] FIG. 8 illustrates a diagram for fimctions performed by an audio 
matcher, utilizing aspects of the present invention. 

[0031] FIG. 9 illustrates a flow diagram of amethod for sanq)ling input 
profile chunks and determining chunk oflfeets to be used for searching a reference 
profiles tree for profile hooks, utilizing aspects of the present invention. 

[0032] FIG. 10 illustrates a flow diagram of amethod for searching a 
reference profiles tree for profile hooks coiiesponding to an input profile chunk 
offset, utilizing aspects of the present invention. 

[0033] FIG. 11 illustrates a flow diagram of amethod for conrparing an 
input profile against reference profiles corresponding to profile hooks retrieved 
&om a reference profiles tree search, utilizing aspects of the present inventioiL 

[0034] FIG- 12 illustrates a flow diagram of amethod for merging mini- 
matches, utilizing aspects of the present invention. 
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[0035] FIG. 13 illustrates a diagram for sub-functions perfonned by an 
audio matcher, utilizing aspects of the present invention. 
[0036] FIG. 14 illustrates a flow diagram of amethod for identifying an 
mput file as a spoof file, utilizing aq)ects of flie present invention. 
[0037] FIG. 15 illustrates a flow diagram of a method for identij^g a 
best match for an wpnt file, utilizing aspects of the present invention, 

nFT An .KD DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0038] All methods, generators and programs described herein are 
preferably performed on one or more computers cooperating together such as in a 
distributed or other processing envirormient 

[0039] Referring to FIG. 1, an audio matcher program 100 matches an 
input profile 101 of an input audio clip to one of a store of reference profiles 102 
of reference audio clips. In addition to tihie input profile 101 and the reference 
profiles 102, a reference profiles tree 103 is also used by the audio matcher 100 to 
facilitate and expedite the matching process, which in this case, results in one of 
the following outcomes: a determination fliat the input profile 101 is a spoof (for 
example, its corresponding input audio clip is not what it purports to be), or an 
identification of an acceptable best match for ttie input profile 101 among the 
reference profiles 102, or a determination that no acceptable match has been 
found for the input profile 101 among the reference profiles 102. 
[0040] The reference audio clips in this case may be published music that 
is protected by copyright law, and the input audio clips may be audio files either 
residing on user computers or being transmitted through the Internet using a file 
sharing network. Formats for the audio clq)S may be any standard format such as 
1VIP3. 

[0041] FIG. 2 illustrates, as an example, a method by which the reference 
profiles and dereference profiles tree are generated. Li particular, a profile 
generator 202 generates the referOTce profiOies 102 fix>m corresponding reference 
audio clips 201. A reference profiles tree generator 203 then generates a 
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references profiles tree 103 from the reference profiles 102. The format of the 
reference profiles tree 103 is a sparse binary tree in order to minimize data 
storage requirements and maximize search performance. 
[0042] FIG. 3 illustrates, as an example, a method performed by the 
profile generator 202 to generate profiles fiom di^tized audio clips. In the 
mefliod, a chunk represents a programmable period of time such as 0.1 seconds of 
the audio clip. In 301, digitized san[q)le infi3rmalion for a first chunk of tibte audio 
clip is serially read, and in 302, the number of zero crossmgs in the chunk is 
counted. A zero crossing occurs each time the sign changes between adjacent 
samples. In 303, a determination is made whetiier flie current chunk is a last 
chunk in the profile. If the determination results in a YES, then the profile 
generator 202 terminates. On the other hand, if the determination is NO, then in ' 
304, digitized sample information for a next chunk of the audio clip is serially 
read, and the process continues by jumping back to 302 and repeating 302'-304 
until the last chunk in the profile has been processed through 302. 
[0043] The profile generator 202 is used to generate reference profiles 
102 fiom reference audio clips 201 (as shown in FIG. 2) and to generate an mput 
profile 101 fiom an input audio clip 701 (as shown in FIG. 7). In addition to 
counting zero crossings as described in reference to FIG. 3, the profile generator 
202 may also generate other information such as amplitude ratios between 
successive chunks to better characterize audio clips in its generated profiles and 
improve matching accuracy by the audio matcher 100 through the use of such 
enhanced profiles. 

[0044] FIG. 4 illustrates, as an eKznxple, a method performed by the 
referrace profiles tree generator 203 to generate a reference profiles tree 103 by 
including reference profile information for each of the reference profiles 102 in it 
As previously desmbed, flie reference profiles tree 103 is generated as a sparse 
binary tree. 
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[0045] FIG. 5 illustrates, as an example, a method performed in function 
402 of FIG. 4 to add reference profile information to the reference profiles tree 
103. As noted in FIG. 4, this metiiod will be performed for each of the reference 
profiles 102, so that information of all of the reference profiles will be stored m 
the same reforence profiles tree 103. The reference profiles tree 103 and 
particularly, its sparse binary tree architecture, will subsequently be used by the 
audio matcher to significantly speed up the matching process. 

[0046] As used herein, tixe term "chunk ofi&ef ' means ihc difference in 
number of chunks between a current diunk of the refiarence profile and a first 
chunk of the reference profile, plus one. Thus, the number of tiie chunk is equal 
to the chunk offset in this convention. 

[0047] Two programmable parameters are used in the method. The tenn 
^Velocity*' means the number of chunks between local maximums in the reference 
profile, and the terai "acceleration" means the change in velocity divided by the 
number of chunks over which the change occurs. Initial values for velocity and 
acceleration are pre-defined prior to perforaiance of the fimction 402. As an 
example, the initial velocity may be set to 1, and the initial acceleration may also 
be set to 1. The velocity is then modified according to the method. The 
acceleration, on the other hand, is generally constant at its initial value. 
[0048] In 501, the chunk ofEset is initialized to be equal to the initial 
velocity. la 502, a determination is made whether the zero crossing count for the 
current chunk is a local maximum. To be considered a local maximum, the zero 
crossing count for the current chunk must be greater by a programmed threshold 
value than both the zero crossing count for the chunk right before the current 
chunk and the zero crossing count for the chunk right after &ecimrentchun^ Jn 
situations where the current chunk does not have eitiier a chunk rig^t before it 
(i.e., it is the first chunk in flie reference profile) or a chunk ri^t after it (i.e., it is 
the last chunk in the reference profile), a zero will be assumed for the zero 
crossing count in those cases. 



9 



wo 2005/109179 



PCT/US2005/015643 



[0049] If the detennination in 502 is YES, then in 503, a profile hook for 
this chunk ofiset is stored in the reference profiles tree 103. Additional details on 
503 are described in reference to FIG. 6 below. 

[0050] On the other hand, if the determination in 502 is NO, then in 504, 
the chunk ofifeet is mcremented by the velocity. 
[0051] In 505, a determination is then made whettier the end of the 
reference file has been reached. This determination would be YES, if the new 
chunk of&et is greater than the chunk number of the last chunk in the reference 
profile. Therefore, if the detennination in 505 is YES, then the method is done, 
and anoflier reference profile can be processed as shown in FIG. 4. 
[0052] On the other hand, if the determination in 505 is NO, then in 506, 
the velocity is incremented by the acceleration. By incrementing the velocity in 
this fashion, chunks will be processed in a more efficient manner. Rather then 
processing every chunk in a reference profile to see if it is a local maximum, 
chimks are processed in a quadratically increasing fashion to take advantage of 
the observation that matches between input profiles and reference profiles usually 
can be determined early on in the profiles. 

[0053] The method then loops back to 502 to process the newly calculated 
chunk offset, and continues looping through 502^506 until the end of the 
reference profile is determined in 505: 

[0054] FIG. 6 illustrates, as an example, a method performed in fimction 
503 of FIG. 5 to store a profile hook into the reference profiles tree 103 for each 
chunk ofi&et identified in 502 of FIG. 5 as having a local maximum zero crossing 
count In 601 and 602 respectively, the current node at whidi processing starts 
on the sparse binary tree is set to its root node and the chunk at which processing 
starts is set to fhe chunk ofi&et being processed at the time. 

[0055] In 603, a determination is made whether the zero crossing count 
for the cunent chunk is greater than a programmable constant or threshold value. 
If the detemiination in 603 is NO, then in 604, the current node is changed to a 
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right-branch child node, which is created at that time if it doesn't akeady exist in 
the reference profiles tree 103. On the other hand, if the determination in 603 is 
YES, then in 60S, the current node is changed to a left-branch child node, which 
is created at that time if it doesn't already exist in the reference profiles tree 103. 
[0056] In 606, a determination is then made wheth^ the current chuiik is 
the last chunk in tiie reference profile. If the detennination in 606 is NO, then in 
607, the current chunk is incremented by 1, and ttie method loops back to 603, 
and continues looping through 603^607 until the determination in 606 is YES. 
When the detemunation in 606 is YES, then in 608, the method stores the profile 
hook in the then current node, and is done, the profile hook in this case includes 
a profile identification or "ID" and the chunk of&et that is being processed at the 
time in fimction 503. The profile ID serves to uniquely identify the content of the 
reference profile in ttiis case. 

[0057] In the following description, it is now assumed that generation of 
the reference profiles tree 103 is complete so that it contains information of 
profile hooks for each of the reference profiles 102 at various of its nodes. 

[0058] FIG. 7 illustrates, as an example, a method for generating an input 
profile 101 that parallels the method used for generating each of the reference 
profiles 102. 

[0059] FIG. 8 illustrates three primary functions performed by tihe audio 
matcher 100. In a first function 801, the audio matcher 100 identifies mini- 
matches of an iiq>ut profile 101 in the reference profiles tree 103. If no mini- 
matches are found, then this function reports back that the input profile is a no- 
match. Since the no-match is detemoined early on in the process, this avoids the 
necessity to perfonn subsequent processing to determine a best match or that the 
input profile is for a ^oof. 

[0060] Assuming mini-matches have been identified between the input 
profile and one or more reference profiles, then in a second function 802, the 
audio matcher 100 then stores and merges when appropriate the mini-matches for 
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subsequent processing. In a third function 803, the audio matcher 100 then 
determines one of the following: an acceptable best match for the input profile; a 
determination that the input profile is a spoof; or a no-match if the input profile is 
not detemuned to be a spoof or if an acceptable best match cannot be found. 
[0061] FIG. 9 illustrates, as an example, a method for performing the 
mini-match identifying function 801 of FIG. 8. The method parallels that of 
FIG. 5, wherein local TngnHTwiimg in a reference profile are identified so that 
profile hooks can be stored in the ref(^:ence profiles tree 103. In this case, 
however, local maximums in the input profile are identified so that mini-matches 
may be found in the reference profiles tree 103. In particular, the identification of 
zero crossing count local maviTnuins in the ii^>ut profile as performed in 901, 902 
and 904-906 are performed identically as their counterparts 501, 502 and 
504^06 of FIG. 5 in identifying zero crossing count local maximimis in a 
reference profile. 

[0062] In 903, however, rather than storing a profile hook in the reference 
profiles tree for the chunk oSset as performed in 503 of FIG. 5, the chunk offset 
is used to search for matches in the reference profiles tree 103. 

[0063] FIG. 10 illustrates, as an example, a method performed in function 
903 of FIG. 9. The method is similar to that of FIG. 6, wherein movement down 
the reference profiles tree 103 is performed. In this case, however, zero crossing 
counts in the input profile are used instead of the zero crossing counts in a 
reference profile to determine the movement down the reference profiles tree 103. 

[0064] Startmg in 1001, the current node in fixe refermce profiles tree 103 
is initially set to the root node, and in 1002, the current chunk is set to the chunk 
of&et currently being processed. 

[0065] In 1 003, a determination is made whether the zero crossing count 
for the current chunk is greater than a programmable constant Ihe constant that 
is to be used here is fiie same as that used in 603 of FIG. 6. 
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[0066] If the determination in 1 003 is NO, then in 1 004, the current node 
is changed to a right-branch child node. On the other hand, if the determination 
in 1003 is YES, then in 1005, the current node is changed to a left-branch child 
node. 

[0067] In 1006, a determination is then made whether the current chunk is 
the last chunlc in the input profile. If the determination in 1006 is NO, then in 
1007, the current chunk is incremented by 1, and the method loops back to 1003, 
and continues looping trough 1003-1007 until the determination in 1006 is 
YES. When the determination in 1006 is YES, then in 1008, the method matches 
the input profile against all reference profiles identified in profile hoolcs stored at 
the current node of the reference profiles tree 103. 

[0068] BIG. 11 illustrates, as an example, a method for performing the 
fimction 1008 of FIG. 10. In 1101, a determination is first made whether there 
are any reference profiles identified in reference profile hooks stored at the 
current node of the reference profiles tree 103. If the determination in 1101 is 
NO, then the method has nothing more to do so it ends. 

[0069] On the other hand, if the determination in 1101 is YES, then in 
1102, the first N chunks of the input profile are compared with the corresponding 
first N chunks of a first reference profile identified In 1 103, a determination is 
made whether they match. Li order for corresponding chunks to match, their zero 
crossing counts do not have to be exactly equal. As long as the absolute 
' difference between the zero crossing counts is within a programmed tolerance, 
they may be determined to be a match. Also, it may not be necessary for all of 
the first N chunks to match, the match determinationmay be a YES as long as a 
higji enougfh percentage of tiie first N chunks match. 

[0070] If the determination in 1103 is a YES, then in 1104, a mini-match 
at the current ofi&etof&e input profile is generated. Generation of flie mini- 
match involves including the information in the following table in the mini- 
match. 
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Table I. Infonnation included in a mini-match. 



Parameter 


Description 


timei 


The offeet into the input profile at which this match begins. 


time2 


The offset into the input profile at which this match ends. 


Wti 


The ofiEset into the input profile at which the reference profile 
begins. 


Wt2 


The ofiset into the input profile at which the reference profile 
ends. 


time matched 


The amount of match time represented by this mini-match. 


full match 


•True" if this is a •*fuir match. 


source 


The ID for the reference profile corresponding to this mini-match. 


err 


The total amount of error in this mini-match. 


ID 


An identifi^ for the mini-match. 



[0071] On the other hand, if the determmation in 1103 is a NO, thra in 
1105, a determination is made whether there is another reference profile 
identified at the current node of the reference profiles tree 103. If the 
determination in 1105 is YES, then in 1106, the first N chunks of the input profile 
are then compared with those of the next identified reference profile, and the 
method continues by looping through 1103-1106 until either a match is found or 
there are no more reference profiles to be compared against the input profile. 
[ 0072 ] If the determination in 1105 results at any time in a NO, then in 
1107, the method generates a ''non-fiiU" mini-match using the best matching one 
of the reference profiles identified at the current node of the reference profiles 
tree 103 (i.e., the reference profile whose first N chunks came closest to being 
determined as a inatch to flie first N chunks of tiie input profile). Aswitfathe 
**fiill" mini-malch generated in 1104, the "non-fiill" mini-match will also be 
associated to flie cuzient of&et of the input profile. 

[0073] HG. 12 illustrates, as anexanople, amethod for performing the 
mini-match storing and merger function 802 of FIG. 8. In 1201, a first one of the 
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mini-matches generated in the function 801 of FIG. 8 is input. The mini-match 
can be either a "full" or "non-fiill" mini-match. In 1202, a determination is made 
whether any mini-matches have already been stored in the audio matcher 100 for 
subsequent processing. If the determination in 1202 is NO (as it will be for the 
first mini-match being input for the input profile), fhssa in 1203, the mini-match is 
added to flie store and the method juiiq>s down to 1207. 
[0074] hi 1207, a determination is then made whether there are any more 
nuni-matches to be iiq>ut. If tiie detomination in 1207 is YES, fhm the me&od 
jumps back to 1201 to hxpui the next mini-match. In 1202, a determination is 
once again made whether there are any stored mini-matches. This time, since the 
Gxst niiiii-match was stored, the determination will result m a YES , so that the 
method proceeds to 1204. 

[0075] In 1204, a search is performed to find a merger candidate for the: 
current mini-match among the mini-matches aheady in the store. In order to be 
considered a merger candidate, the current mini-match and the stored mini-match 
must refer to the same reference profile ID, and any difference between their 
respective wti parameters (offsets into the input profile at which the reference 
profile begins) must be within a specified tolerance such as SO chunks or 5 
seconds. 

[0076] In 1205, a determination is then made whether a merger candidate 
has been found. If the determination in 1205 is NO, then the current mini-match 
is added to the store in 1203, and the method proceeds fix>m there as previously 
described. . 

[0077] On the other hand, if the determination in 1205 is YES, then in 
1206, the current mini-match is merged with the merger candidate. When 
merging the current mini-match with the merger candidate, &e parameter values 
for wti, wt2, timei and tim^ of the merged mini-match are wei^ted averages of 
the current mini-match and the merger candidate values, weigjh^ted by their 
respective matched times. The parameter value for '*err" of the merged mini- 
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match is the sum of the current mini-match and the merger candidate values. If 
either the currmt mini-match or the merger candidate is a "full" match, then the 
merged mini-match has its full match parameter set to true. 
[0078] After merger, the method proceeds to 1207. 
[0079] In 1207, a determination is made whether there are anymore mini- 
matches to be processed. If the detennination in 1207 is YES, then the method 
proceeds by looping through 1201^^1207 until all mini-ntiatches have been 
processed by either being stored individually in the audio matcher store or 
merged with another mini-match already stored in the audio matcher store, and 
the determination in 1207 at that time results in a NO. 
[0080] FIG. 13 illustrates, as an example, a software structure for 
implementing the function 803 of FIG. 8 in which a first function 1301 
determines if the input profile is a spoof, and a second function 1302 finds a best 
match firom the store of mini-matohes generated in 802 of FIG. 8. The first and 
second functions 1301 and 1302 may be performed serially, or in parallel as 
shown. If both functions 1301 and 1302 feil (i.e., the first function 1301 fails to 
identify the input profile 101 as a spoof and the second function 1302 fails to find 
an acceptable best match), then in 1303, it is determined that ttie input profile 101 
is a no-match (i.e., no match has been foimd for it among the reference profiles 
102). 

[0081] FIG. 14 illustrates, as an example, a method for performing the 
first function 1301 to detennine whether the input profile is a spoof In 1401, a 
determination is made whether there is more than one reference profile identified 
by the mini-matches in the store. If flie detennination in 1401 results in a NO, 
then in 1402, a no spoof found conclusion is made and the method stops at that 
point 

[0082] On the oth^ hand, if the determination in 1401 is YES, then in 
1 403, a detennination is made whether the sum of the time matched for all the 
mini-matches in the store is greater fhm some ftreshold percentage of the iiq>ut 
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profile such as, for example, 70%. If the determination in 1403 results in a NO, 
then in 1402, a no spoof found conclusion is made and the method stops at that 
point 

[0083] On the other hand, if the determination in 1403 is YES, then in 

1404, a determination is made whether each mini-match has an error/second 
value that is less than some maximum value. The error/second value for each 
mini-match may be calculated by the ratio of the mini-match's '^eix^' parameter 
and *time matched" parameter. If the detemoination in 1404 results in a NO, then 
' * ' ^'^^ " * * lusion is made and the method stops at that point 
L0084J Onthec nand,ifthedetenninationinl404is YES,theiiin 

1405, a spoof found c^-^^^ ?s made and the method stops at that point In 
this case, the sooof may be lu^^. y compositing several tracks together or 

' . of one track. Since fliese kinds of spoo& are quite 

common on peer-to-peer networks, the ability to automatically identify them is 
useful. 

[0085] flG. 15 illustrates, as an example, a method for performing the 
second function 1302 to find an acceptable best match for the input profile 101. 
In 1501 and 1502, the method starts by ignoring all ^'non-fiiU" mini-matches and 
all mini-matches having an errors/second greater than a maximum allowable 
value. 

[0086] In 1503, the method then identifies one of the remaining mini- 
matches as a best match according to programmed criteria such as its 
errors/second value, its time matched value, and the percentage of its reference 
profile that it recognizes. Typically, fhe best match will be a mini-match that 
exceeds all other mini-matches in all of these criteria. In the event that two mini- 
matches are close, some weighting of the criteria may be performed to determine 
a best match between the two. 

[0087] In 1504, a determination is then made whether the percentage of 
the iiQ>ut profile and the reference profile covered by the best match exceeds 
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some minimum value. If the deteraaination in 1504 is YES, then in 1505, the best 
match identijBed in 1503 is concluded to be an acceptable best match and the 
method ends at that point On the other hand, if the detennination in 1504 is NO, 
then the best match identified in 1503 is concluded in 1506 to be an unacceptable 
best match and the method ends at that point with a conclusion in this case that no 
acceptable best match was found. 

[0088] Although file various aspects of the present invention have been 
described with respect to a preferred embodiment, it will be und^tood that the 
invention is entitled to full protection within the fiiU scope of the sqppended 
claims. 
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CLAIMS 

1 . A method for matching an input audio file with a plurality 
of refo*ence audio files, comprising: 

identifying potential matches of an input audio file among a 
plurality of reference audio files based upon at least one common characteristic; 

and 

searching for a match of the input audio file among the pot^otial 

matches. 

2. The method according to claim 1» wheiein the 
identification of the potential matches conq>rises identifying potential matches of 
the input audio file among the plurality of reference audio files by associating 

r ?*^:^rse binary tree with the input audio file in a same manner used to. 
associate noaes of the sparse binary tree with the plurality of reference audio 
files. 

3. The method according to claim 1, wherein the 
identification of potential matches comprises: 

generating an input profile fix>m the input audio file based upon a 
measurable attribute also used to generate a plurality of reference profiles firom 
the plurality of reference audio files; 

identifying potential matches among the plurality of reference 
profiles with the input profile by processing the input profile in a manner used to 
associate the plurality of reference profiles with nodes of a sparse binary tree. 

4. Hie method according to claim 3, wherein the generation 
of the input profile conqmses: 

segmenting the input audio file into chunks representing a period 

of time; and 
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counting a number of occurrences of a measurable attribute of the 
input audio file in individual of the chunks. 

5. The method according to claim 4, wherein individual of the 
chunks includes information of digitized samples of an audio clip over the period 
of time, and tiie measurable attribute for individual of the chunks is a number of 
zero crossings of the digitized samples in the chunk. 

6. The method according to claim 4, wherein individual of the 
plurality of reference profiles has been generated &om a corresponding one of the 
plurality of reference audio files by segmenting the refiorence audio file into 
chunks representing the period of time, and counting a number of occurrences of 
the measurable attribute of the reference audio file in individual of the chunks. 

7. The method according to claim 6, wherein the plurality of 
reference profiles are associated with nodes of the sparse binary tree by sampling 
chunks in individual of the reference profiles so that if a sampled chunk has a 
measurable attribute satisfying a criterion, thai a profile hook including an 
identification of the reference profile is associated with a node in the sparse 
binary tree. 

8. The method according to claim 7, wherein individual of the 
chunks of the input audio file includes information of digitized samples of an 
audio clip over their corresponding period of time, and the measurable attribute 
for individual of the chunks is a number of zero crossings of ttie digitized sample 
in the chunk. 

9. The method according to claim 8, wherein the criterion is 
satisfied if the zero crossing count of a sanq)led chunk is a local maximum. 

1 0. The method according to claun 7, wherem the profile hook 
is associated with a node in die sparse binary tree by storing the profile hook at a 
cuirent node resulting firom moving down left or ri^ branch child nodes starting 
with a root node of &e sparse binary tree depending \xpon wheQier a current 
chunk has a measured attribute value greato* flian a specified value starting with 
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the sampled chunk having the measurable attribute that satisfies the criterion and 
ending with a last chunk of flie reference profile. 

1 1 . The method according to claim 7, wherein the sampling of 
chunks in individual of the reference profiles is performed on an ever increasing 
sampling basis. 

12. The method according to claim 11, wherein the ever 
increasing sampling basis is a quadratically increasing sample basis. 

13. ' The method according to claim 1 1, wherein the ever 
increasing saicpling basis is an exponentially increasing sample basis. 

14. The method according to claim 1 1, wherein the sampling 
of chunks in individual of the refimnce profiles is performed on an ever 
increastag sampling basis starting with the sampled chunk having the measurable 
attribute that satisfies the criterion and ending with the last chunk of the reference 
profile by incrementing through the chunks at a specified velocity and 
acceleratiorL 

15. The method according to claim 14, wherein the velocity is 
the number of chunks between local maxima in the reference profile, and the 
acceleration is the change in velocity divided by the number of chunks over 
which the change occurs. 

16. The method according to claim 10, wherein the 
idmtification of potential matches among the plurality of reference profiles 
comprises identifying potential mini-matches by retrieving profile hooks 
associated with nodes in the sparse binary tree by identifying profile hooks 
associated with a current node resulting firom moving down left or rig^t branch 
child nodes starting with the root node of the sparse binary tree depending iqK)n 
wheUier a current chunk of the input profiUe has a measured attribute value greater 
than the specified value starting with the sampled chunk of fiie input profile 
having fiie measurable attribute fbst satisfies the critmon and ending with the last 
chunk of the iz^ut profile. 
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17. The method according to claim 16, wherein the 
identification of mini-matches among the plurality of reference profiles fiirther 
comprises for individual reference profiles corresponding to the retrieved profile 
hooks: 

comparing a number of chimks of the input profile and 
corresponding chunks of &e refimnce profile; and 

idmtifying a mini-match if corresponding chunks of the reference 
profile substantially matches those of the input proSle. 

18. The method according to claim 17, wherein the 
identification of mini-matches among the plurality of reference profiles fiirther 
conxprises: identifying a non-fiiU mini-match using a best matching one of the 
reference profiles with the input profile if none of flie reference profiles identified 
by the profile hooks substantially matches those of the input profile. 

19. The method according to claim 1 8, wherein the 
identification of potential matches fiirther comprises: merging any mini-matches 
and non-fiiU mini-matches corresponding to the same reference profile and 
having an ofiset into the input profile at which the reference profile begins within 
a specified tolerance. 

20. The method according to claim 19, wherein if a mini- 
match is merged with a non-fiiU mini-match, then the merged entity is referred to 
as a mini-match. 

21. The method according to claim 20, fiirther comprising: 
identifying the input audio file as a spoof if all mini-matches identified for the 
iiq>ut profile do not refer to the same reference profile. 

22. The method according to claim 21 , wherein fiie input audio 
file is not identified as a spoof if the sum of the total audio time covexed by the 
mini-matohes is less than a specified threshold value. 

23. The meOiod according to claim 22, wherein fiie ioput audio 
file is not identified as a spoof even if the sum of the total audio time covered by 
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the mini-matches is not less than the specified threshold value, if any of the mini- 
matches has an associated error per second value that is greater than a first 
specified maximum value. 

24. The method according to claim 25, wherein the searching 
for the match results in a best match being found if a percentage of the input 
profile and the reference profile covered by tiie best match exceeds some 
minimum value after ignoring all non-full mini-matches, ignoring mini-matches 
having an error per second value that is greater than a second specified maximum 
^-1.:-.- ^^f-^ '•nnsiderstip^ ^r^^'^ *~ ^^^^:^mm^ criteria. 

jn. 1^^-^^-^ -.u^ iAj^^.^iimg aii ii3pu-. audio file with a plurality 
of reference audio files, coraprising: 

identif>dT ^ * i1 matches of an input audio file among a 
plurality of reference audio files based upon at least one common charactmstic; 
and 

.:omparing an input profile resulting fit>m a measurable attribute of 
the input audio file with reference profiles resulting from the same measurable 
attribute of the potential matches to determine a match. 

26. The method according to claim 25, further comprising 
generating the input profile by: 

segmenting the iq)ut audio file into chunks representing a period 

of time; and 

counting a number of occurrences of a measurable attribute of the 
input audio file in individual of the chunks. 

27. The method according to claim 26, wherdn individual of 
flie chunks includes information of digitized samples of an audio clip over the 
period of time, and the measurable attribute for individual of the chunks is a 
number of zero crossings of Uie digitized sanQg)les in the chunk. 

28. The method according to claim 26, wherem individual of 
the pluraUty of reference profiles has been generated fix)m a corresponding one of 
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the plurality of reference audio files by segmenting the reference audio file into 
chunks representing the period of time, and counting a number of occurrences of 
the measurable attribute of the reference audio file in individual of the chunks. 

29. The method according to claim 28, wherein the plxirality of 
reference profiles are associated with nodes of the sparse binary tree by sampling 
chunks in individual of the reference profiles so that if a sampled chunk has a 
measurable attribute satisfying a criterion, then a profile hook including an 
identification of the reference profile is associated with a node in the sparse 

JU. jLut; mettiocl accoi^g to claim 29, wherein individual of 
the chunks of the input audio file includes information of digitized samples of an 
audio clip over their coiresponding period of time, and the measurable attribute 
for individual of tibie chunks is a number of zero crossings of the digitized samples 
in the chunk. 

3 1 . The method accoidmg to claim 30, wherein the raiterion is 
c5c^^....v^ Uie zero crossing count of a sampled chunk is a local maximum. 

32. The method according to claim 29, wherein the profile 
hook is associated with a node in the sparse binary tree by storing the profile 
hook at a current node resulting &om moving down left or right branch child 
nodes starting with a root node of the sparse binary tree depending upon whether 
a current chunk has a measured attribute value greater than a specified value 
starting with the sampled chunk having the measurable attribute that satisfies the 
criterion and ending with a last chunk of the reference profile. 

33. The method according to claim 29, wherein the sanq>Iing 
of chunks in individual of the reference profiles is performed on an ever 
increasing sanq)ljng basis. 

34. The meOiod according to claim 32, wherein the 
identification of potential matches among the plurality of reference profiles 
comprises identifying i>otential mini-matches by retrieving profile hooks 
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associated with nodes in the sparse binary tree by identifying profile hooks 
associated with a current node resulting firom moving down left or right branch 
child nodes starting with the root node of the sparse binary tree dependmg upon 
whether a current chunk of the ii^ut profile has a measured attribute value greater 
than the specified value starting with the sampled chunk of ttie input profile 
having the measurable attribute that satisfies the criterion and ending with the last 
chunk of the input profile. 

35. The method according to claim 34, wherein the comparison 
' ^''^^ the ref<a:ience profiles of the potential matches comprises 

for mdividual reference profiles corresponding to the retrieved profile hooks: 
comparing a number of chunks of the input profile and 

coiresponding chunks of the reference profile; and 

identifying a mini-match if corresponding chunks of the reference 

profile substantially matches those of fbc input profile. 

36. The method according to claim 35, wherein the comparison 
of the input profile with the reference profiles of the potential matches fiirther 
comprises: identifying a non-fiill mini-match using a best matching one of the 
reference profiles with the input profile if none of the reference profiles identified 
by the profile hooks substantially matches those of the input profile. 

■ - 37 . The method according to claim 36, wherein the comparison 

of the mpnt profile with the reference profiles of the potential matches further 
comprises: meipng any mini-matches and non-fiill mini-matches corresponding 
to the same reference profile and having an of&et into the input profile at which 
the reference profile begins wiflun a specified tolerance. 

38. The method according to claim 37, wherein if a mini- 
match is merged with a non-fiill mini-match^ then the merged entity is referred to 
as a mini-match. 
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39. The method according to claim 38, further comprising: 
identifying the input audio file as a spoof if all mini-matches identified for the 
input profile do not refer to the same reference profile. 

40. The method according to claim 38, wherein the comparison 
of the mpat profile with the reference profiles of the potential matches results in a 
best match being found if a percentage of the inpnt profile and the reference 
profile covered by the best match exceeds some minimum value after ignoring all 
non-fiiU mini-matches, ignoring mini-matches having an em>r p^ second value 

* — ^r-nrn value, and taking into 

consideration aiiy ovuc;i pto^aUMmj^ tji^i.wi&. 

41 . A method for matching an input file with a plurality of 
reference files, comprising: 

identifying potential matches of an input file among a plurality of 
reference files by associating nodes of a sparse binary tree with the input file in a 

^ ^'«c-d to associate nodes of the sparse binary tree with the plurality 

Ox Acierencc xties; and 

searching for a match of the input file among the potential 

matches. 

42. The method according to claim 4 1 , wherein the input file 
and the reference files include digitized san^les of time varying signals. 

43 . The method according to claim 42, wherein the 
identification of potential matches comprises: 

generating an input profile fix>m the input file based upon a 
measurable attribute also used to generate a plurality of reference profiles fixmi 
the plurality of reference files; 

identifying potential matches among the plurality of reference 
profiles with the input profile by processing the input profile in a manner used to 
associate the plurality of refermce profiles with nodes of tiie sparse binary tree. 
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44. The method according to claim 43, wherein the generation 
of the input profile comprises: 

segmenting the input file into chunks representing a period of 

time; and 

counting a number of occurrences of a measurable attribute of the 
input file in individual of the chunks. 

45. The mefiiod according to claim 44, wherem individual of 
the chunks includes iaformation of digitized samples of an audio clip over the 

.d of time, and the measurable attribute for individual of the chunks is a 
number of zero crossings of the digitized samples in flie chunk. 

46. The method ttOv;ording to claim 45, wherein individual of 
the plurality of reference profiles has been generated fix»m a corresponding one of - 
the plurality of reference files by segmenting the reference file into chunks 
rqsresenting the period of time, and counting a niunber of occurrences of flie 
measurable attribute of tiie reference file in individual of the chunks. 

47. The method according to claim 46, wherein the plurality of 
reference profiles are associated with nodes of the sparse binary tree by sampling 
chunks in individual of the reference profiles so that if a sampled chimk has a 
measurable attribute satisfying a criterion, then a profile hook including an 
identification ofthe reference profile is associated with a node in the sparse 
binary tree. 

48. The method according to claim 47, wherein individual of 
the chunks ofthe input file includes information of digitized samples of an audio 
clip over their corresponding period of time, and the measurable attribute for 
individual of the chunks is a number of zero crossings of the digitized samples in 
the chunk. 

49. The method according to claim 47, wherein the profile 
hook is associated with a node in the sparse binary free by storing the profile 
hook at a current node resulting &om moving down left or right branch child 
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nodes starting with a root node of the sparse binary tree depending upon whether 
a current chunk has a measured attribute value greater than a specified value 
starting with the sampled chunk having the measurable attribute that satisfies the 
criterion and ending with a last chunk of the reference profile. 

50. The method according to claim 47, wherein the sampling 
of chunks in individual of the reference profiles is performed ^" an ever 
increasing sampling basis. 

51. ^ .^t' .^,whs 3 
.aalm?*'^he3ai^''~ ,thc3-""''u-a''*''of ' .*;i:^ncepr- files 

compnses iaenti^dngpote-.tL£ -"^m-mztzn - ictx.^ .ing -^r-^'^.^ooks 
associated wiib. nodes in the Sj^. *b>xw. '^'^y- vhooks 

associated -^t^ngln^. ^Oi.. -sr right branch 

r — ijjg j^Q^ jjQ^ of the sparse binary tree depending upon 
whether a current <^hur^; of the input profile has a measured attribute value greater 
than the specified value starting with the sampled chunk of the input profile 
having the measurable attribute that satisfies the criterion and ending with the last 
chunk of the input profile. 

52. The method according to claim 5 1 , wherein the 
identification of mim'-matches among the plurality of reference profiles fiirther 
comprises for individual reference profiles corresponding to the retrieved profile 
hooks: 

comparing a number of chunks of the mpui profile and 
corresponding chunks of the reference profile; and 

identifying a mini-match if coiresponding chunks of the reference 
profile substantially matches fliose of the isput profile. 

53. The method according to claim 52, wherein the 
identification of mini-matches among the plurality of reference profiles fintiier 
conqirises: identifying a non-fuUxnini-match using a best niatching one of the 
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reference profiles with the input profile if none of the reference profiles identified 
by the profile hooks substantially matches those of the input profile. 

54. The method according to claim 53, wherein the 
identification of potential matches fiirther comprises: mer^g any mini-matches 
and non-fiiU mini-matches corresponding to the same reference profile and 
having an offset into the input profile at which the reference profile begins within 
a specified tolerance. 

55. The method accordmg to claim 54, wherein if a mini- 

_ . r- -^^^^^^1 ^ ^.11 -.J-,: — fv,^ merged entity is referred to 
as amim-iuaicn. 

56. The met'uou .according to claim 55, fiirther comprising: 
identifying th? - -r-nf if all mini-matches identified for the input ; 
profile do not iw... reference profile. 

57. The method according to claim 55, wherein the searching 
^^-^^^h results in a best match being found if a percentage of the input 

profile and tae icxCiience profile covered by the best match exceeds some 
minimum value after ignoring all non-fixll mini-matches, ignoring mini-matches 
having an error per second value that is greater than a second specified maximum 
value, and taking into consideration any other programmed criteria. 

. 58: A method for matching an input file with a plurality of 

reference files, comprising: 

identifying potential matches of an input file among a plurality of 
reference files by associating nodes of a sparse binary tree with the iiq)ut file in a 
same manner used to associate nodes of the sparse binary tree with the plurality 
of reference files; and 

coixQ)aiing a profile resulting &om a measurable attribute of the 
iiq)ut file with profiles resulting fix>m ttie same measurable attribute of the 
potential matches to determine a match. 
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59. The method according to claim 1 c, wherein the input file 
and the reference files include digitized samples of time varying signals. 

60. The method according to claim 59, fiirth^ comprising 
generating the input profile by: 

segmenting the input file into chunks representing a period of 

time; and 

counting a number of occurrences of a measurable attribute of the 
inpi-t ' " jA of the chunks. 

. T ^ niethod according to cla^m 60. wherein individual of 

thechuiiksiriCla*^*^^ infonriationof . ^ . or&e 

period ' . _ surable attr" , for b.dr ... .c .anks is a 

wxml of zero crossings of the Otg ' ' ^^es in the chunk. 

62. The method at -"timfe.. \^^erein individual of 
the plu. .v'y " " isrence profiles has beei. ^eneratc.l T c cozresponding one of 
lu^ ^iu££Laiy of reference files by segmenting the reference file into chunks 
representing the period of time, and counting a nmnber of occurrences of the 
measurable attribute of the reference file in individual of the chunks. 

63. The method according to claim 62, wherein the plurality of 
reference profiles are associated with nodes of the sparse binary tree by sampling 
chunks in individual of the reference profiles so that if a sampled chunk has a 
measurable attribute satisfying a criterion, then a profile hook including an 
identification of the reference profile is associated with a node in the sparse 
binary tree. 

64. The mefiiod according to claim 63, wherein individual of 
the chunks of the input file includes information of digitized samples of an audio 
clip over their corresponding period of time, and the measurable attribute for 
individual of tihe chunks is a number of zero crossings of the digitized 5an9)les in 
file chunk. 



30 



wo 2005/109179 



PCT/US2005/015643 



65. The method according to claim 64, wherein the criterion is 
satisfied if the zero crossing count of a sampled chunk is a local naaximum. 

66. The method according to claim 63, wherein the profile 
hook is associated with a node in the sparse binary tree by storing the profile 
hook at a current node resulting fix>m moving down left or right branch child 
nodes starting with a root node of the sparse binary tree depending upon whether 
a current chunk has a measured attribute value greater than a specified value 
starting with the sampled chunk having the measurable attribute that satisfies the 

• 1 - . . ,1. . 1 of flje reference profile. 

67 . rue meiaod according to claim 63, wherein the sampling 
of chunks in individual of the reference profiles is performed on an ever 
increasing sampling basis. 

68. The method according to claim 66, wherein the 
ide^!^*5catjon of potential matches among the plurality of reference profiles 
comprises identifying potential mini-matches by retrieving profile hooks 
associated with nodes in the sparse binary tree by identifying profile hooks 
associated with a current node resulting firom moving down left or right branch 
child nodes starting with the root node of the sparse binary tree depending upon 
whether a current chunk of the input profile has a measured attribute value greater 

- than the specified value starting with the sampled chunk of the input profile 
having the measurable attribute that satisfies the criterion and ending with the last 
chunk of the input profile. 

69. The method according to claim 68, wherein &e comparison 
of the input profile with the reference profiles of the potential matches comprises 
for individual reference profiles corresponding to the retrieved profile hooks: 

con^aring a number of chunks of the ii^ut profile and 
conespondihg chunks of the reference profile; and 

identifying a nuni-match if corresponding chunks of the reference 
profile substantially matches those of the input profile. 
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70. The method according to claim 69, wherein the comparison 
of tiie input profile with the reference profiles of the potential matches further 
comprises: merging any mini-matches corresponding to the same reference 
profile and having an offset into the input profile at which the reference profile , 
begins within a specified tolerance. 

7 1 . The method according to claim 70, wherein the comparison 
of the input profile with the reference profiles of the potential matches results in a 
best match being found if a percentage of the input profile and the reference 

^ 1 ^t,a» t*i^tr^ pr?p»M5; some TnitiiTmim value after ignoring 

value that is greater than a specified 
maximum value, and taking into consideration any programmed criteria. 

72. A method for matching an iiq>ut audio file with a plurality 
of reference audio files, comprising: 

generating an input profile fiom an input audio file based upon a 
measurable attribute also used to generate a plurality of reference profiles &om a 
plurality of reference audio files; 

identifying potential matches among the pluraUty of reference 
profiles with the input profile by processing the input profile in a manner used to 
associate individual of the pliurality of reference profiles with nodes of a sparse 
binary tree; and 

comparing the input profile with the potential matches to 
detemoine a match. 

73. The method according to claim 72, wherein the generation 
of fht input profile comprises: 

segmenting the inpvA audio file into chunks representing a period 

of time; and 

counting a number of occurrences of a measurable attribute of the 
input audio file in individual of the diunks. 
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74. The method according to claim 73, wherein individual of 
the chunks includes information of digitized samples of an audio clip over the 
period of time, and the measurable attribute for individual of the chunks is a 
number of zero crossings of the digitized samples in the chimk. 

75. The method according to claim 73, wherein individual of 
the plurality of reference profiles has been generated fiom a corresponding one of 
the plurality of reference audio files by segmenting the reference audio file into 
chunks representing the period of time, and counting a number of occurrences of 
fi>e nv^--^— '^^'bute of Che reference audio file in individual of the chunks. 

76. The method according to claim 75, wherein the plurality of 
reference profiles are associated with nodes of the sparse binary tree by sampling 
chunks in individual of the reference profiles so that if a sampled chunk has a 
measurable attribute satisfying a criterion, then a profile hook including an 
identification of the reference profile is associated wi& .a node in the sparse 
binary tree. 

77. The method according to claim 76, wherein individual of 
the chunks of the input audio file includes information of digitized samples of an 
audio clip over their corresponding period of time, and the measurable attribute 
for individual of the chunks is a number of zero crossings of the digitized samples 
in the chunk. - 

78. The method according to claim 76, wherein the profile 
hook is associated with a node in the sparse binary tree by storing the profile 
hook at a current node resulting fiom moving down left or right branch child 
nodes starting with a root node of &e q>arse binary tree depending upon whether 
a cuirent chunk has a measured attribute value greater than a specified value 
starting with the sanq)led chunk having the measurable attribute that satisfies tihe 
criterion and ending with a last chunk of tfie reference profile. 

79. The method according to claim 78, wh^dn the 
identification of potential matches among the plurality of reference profiles 
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comprises identifying potential mini-matches by retrieving profile hooks 
associated with nodes in the sparse binary tree by identibEying profile hooks 
associated with a cuirent node resulting from moving down left or right branch 
child nodes starting with the root node of the sparse binary tree depending upon 
whether a current chunk of the input profile has a measured attribute value greater 
than the specified value starting with the sampled chunk of the input profile 
having the measurable attribute that satisfies the criterion and ending with the last 
chunk offlieiiq)ut profile. ~ . 

80. The method according to claim 79, wherein the 
identification of mini-matches among fhe plurality of reference profiles further 
comprises for individual reference profiles corresponding to the retrieved profile 
hooks: 

comparing a number of chunks of the input profile and 
coiresponding chunks of the reference profile; and 

identifying a mini-match if corresponding chunks of the reference 
profile substantially matches those of the input profile, 

8 1 . The method according to claim 79, wherein the 
identification of potential matches further comprises: merging any mini-matches 
coiresponding to the same referrace profile and having an of&et into the input 
profile at which the reference profile begins within a specified tolerance. 

82. The method according to claim 81, further comprising: 
identifying the input audio file as a spoof if all mini-matches identified for the 
mput profile do not rofer to the same reference profile. 

83. The metiiod according to claim 81, wherein fhe searching 
for fhe match results in a best match bemg found if a percentage of tiie input 
profile and the reference profile covered by the best match exceeds some 
minimum value after ignoring mini-matches having an error per second value that 
is greater than a specified maximum value, and taking into consideration any 
programmed criteria. 
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84. A method for matching an input audio file with a plurality 
of reference audio files, comprising: 

generating a plurality of reference profiles firom a plurality of 
reference audio files using a measurable attribute; 

generating a sparse binary tree by applying a process to the 
plurality of reference profiles such that identifications of the plurality of reference 
profiles are associated at resulting nodes of the sparse binary tree; 

generating an input rr' r ^om the input audio file usmg the 

applymg ths^ . ^^ess to the Input profile so that associated 
reference profiles are identiSed from rssuidng nodes of the sparse binary tree; 
and 

con-v .uig at least a portion of the iiq>ut profile with 
'•r^^esponding pordoiii; cit: r identified reference profiles to determine a match. 

- according to claim 84, wherein the generation 
of the plurality of refex ^ .Zlzs comprises: 

segmenting individual of the plurality of reference audio files into 
chunks individually representing a period of time; and 

counting a number of occurrences of the measurable attribute in 
^ individual of the chunks of the plinality of reference audio files. 

86. The method according to claim 85, wherein individual of 
the chunks of the reference audio files includes information of digitized samples 
of an audio clip over the period of time, and the measurable attribute for 
individual of the chunks is a number of zero crossings of the digitized samples in 
the chunk. 

87. The method according to claim 84, wherein the generation 
of the sparse binary tree comprises: sampling chunks in individual of the plurality 
of reference profiles so ibat if a sampled chunk has a measurable attribute 
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satisfying a criterion, then a profile hook including an identification of the 
reference profile is associated with a node in the sparse binary tree. 

88. The method according to claim 84, wherein the generation 
of the input profile comprises: 

segmenting the input audio file into chunks representing a period 

of time; and 

counting a number of occurrences of a measurable attribute of fhe 
input audio file in individual of the chunks. 

89. The method according to claim 88, whira^ individual of 
the chunks of the ioput audio file includes information of digitized samples of an 
audio clip over tiieir corresponding period of time, and tiie measurable attribute 
for individual of the chimks is a number of zero crossings of the digitized samples 
in the chunk. 

90. The method according to claim 89, wherein the criterion is 
satisfied if the zero crossing count of a sampled chunk is a local maximum. 

91 . The metiiod according to claim 89, wherein the profile 
hook is associated with a node in the sparse binary tree by storing the profile 
hook at a current node resulting fix>m moving down left or rigjit branch child 
nodes starting with a root node of the sparse binary tree depending upon whether 
a current chunk has a measured attribute value greater than a specified value 
starting with the sampled chunk having the measurable attribute that satisfies the 
criterion and ending with a last chunk of the reference profile. 

. 92. The method according to claim 84, wherein the application 
of the process to the input file comprises: identifying potential nuni-matches by 
retrieving profile hooks associated v^th nodes in fhe sparse binary tree by 
identifying profile hooks associated with a current node resulting firom moving 
down left or ri^t branch child nodes starting with the root node of the sparse 
binary tree dqpending i^on whetiier a cuirent chimk of the input profile has a 
measured attribute value greater than the specified value starting with tfie 
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sampled chunk of the input projBle having the measurable attribute that satisfies 
the criterion and mding with the last chunk of the input profile. 

93. The method according to claim 84, wherein the comparison 
of the input profiile with the identified reference profiles comprises: 

comparing a number of chunks of the input profile and 
corresponding chunks of the reference profiles; and 

idmtifying a mini-match if any corresponding chunks of the 
reference profiles substantially matches those of the input profile. 

— ,1, , ^ -^-^iag to claim 93, whwein the conoparison 

oi ine mpui proiiie with the identified refermce profiles fiirther comprises: 
merging any mini-matches corresponding to the same reference profile and 
havmg an offeet into the input profile at which the reference profile begfais within 
a specified tolerance. 

95. The method according to claim 94, fiirther comprising: 
identifying the input audio file as a spoof if all mini-matches identified for the 
input profile do not refer to the same reference profile. 

96. The method according to claim 95, wherein the comparison 
of the input profile with the identified reference profiles results in a best match 
being found if a percentage of the input profile and the reference profile 
corresponding to tiie best match exceeds some mIniTniim value after ignoring 
mini-matches having an error per second value that is greater than a specified 
maximum value, and taking into consideration any programmed criteria. 

97. An apparatus for matching an input audio file with a 
plurality of reference audio files, con^)rising at least one compute configured to: 

identify potential matches of an input audio file among a plurality 
of refermce audio files based upon at least one common characteristic; and 
search for a match of the input audio file among the potential 

matches. 
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98. The apparatus according to claim 97, wherein the at least 
one computer is further configured to identify potential matches of the input 
audio file among the plurality of reference audio files by associating nodes of a 
sparse binary tree with the input audio file in a same manner used to associate 
nodes of the sparse binary tree with the plurality of reference audio files. 

99. An ^aratus for matching an input audio file with a 
plurality of reference audio files, comprising at least one computer configured to: 

identify potential matches of an input audio file among a plurality 
j:^ based uoon at least one common characteristic; and 
coHip^-w cjj. ijjiyui profile resulting fix>m a measurable attribute of 

flie input audio file with reference profiles resulting &om the same measurable 

attribute of the potential matches to determine a match. 

100. The apparatus according to claim 99, wherein the at least 
one computer is further configured to generate the input profile by: 

segmenting the input audio file into chunks representing a period 

of time; and 

counting a niunber of occurrences of a measurable attribute of the 
input audio file in individual of the chunks. 

101. The ^paratus according to claim 1 00, wherein individual 
of the plurality of reference profiles has been generated fi-om a corresponding one 
of Ihe plurality of reference audio files by segmenting the reference audio file into 
chunks representing the period of time, and coxmting a number of occurrences of 
the measurable attribute of the reference audio file in iudividual of the chunks. 

102. The apparatus according to claim 101, wherein the 
plurality of reference profiles are associated with nodes of a sparse binary tree by 
sampling chunks in individual of fixe reference profiles so that if a sampled chunk 
has a measurable attribute satisfying a criterion, then a profile hook including an 
identification of the reference profile is associated with a node in the sparse 
binary tree. 
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103. The apparatus according to claim 102, wherein the profile 
hook is associated with a node in the sparse binary tree by storing the profile 
hook at a current node resulting from moving down left or right branch child 
nodes starting with a root node of the sparse binaiy tree depending upon whether 
a current chunk has a measured attribute value greater than a specified value 
starting with the sampled chunk having the measurable attribute fhat satisfies the 
criterion and ending with a last chunk of the reference profile. 

104. The q)paratus according to claim 103, wherem the at least 
one rrmx^nter is further configured to identify potential mini-matches by 
retrieving profile hooks associated with nodes in the sparse binary tree by 
identifying profile hooks associated with a current node resulting &om moving 
down left or rigiht branch child nodes starting with the root node of fiie sparse 
binary tree dependiug upon whether a current chunk of the input profile has a 
measured attribute value greats than the specified value starting with the 
sanqpled chunk of the input profile having the measurable attribute that satisfies 
the critmon and endiag with the last chunk of the input profile. 

105. The apparatus according to claim 104, wherein the at least 
one computer is further configured to merge any mini-matches corresponding to 
the same reference profile and having an offset into the iiiput profile at which the 

' " reference profile begins within a specified tolerance. ^ 

106. The ^paratus according to claim 105, wherein the at least 
one compute is further configured to identify the input audio file as a spoof if all 
rmni-matches identified for the mput profile do not refer to the same reference 
profile. 

107. The s^aratus according to claim 106, wherera the at least 
one coiiq)uter is further configured to not identify the input audio file as a spoof if 
the sum of the total audio time covered by the mini-matches is less than a 
specified threshold value. 
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108. The apparatus according to claim 107, wherein the at least 
one computer is further configured to not identify the input audio file as a spoof 
even if the sum of the total audio time covered by the mini-matches is not less 
than the specified threshold value, if any of the mini-matches has an associated 
error per second value that is greater than a first specified maximum value. 

109. An apparatus for matching an input file with a plurality of 
reference files, comprising at least one computer configured to: 

~ identify potential matches of an input file among a plurality of 
reference files by associating nodes of a sparse binary tree with the input fiOie in a 
same manner used ta associate nodes of the sparse binary tree with the plurality 
of reference files; and 

search for a match of the input file among the potential matches. 

110. The apparatus according to claim 109, wherein the input 
file and the reference files include digitized samples of time varying signals. 

111. An apparatus for matching an input file with a plurality of 
reference files, comprising at least one computer configured to: 

identify potential matches of an input file among a plurality of 
reference files by associating nodes of a sparse binary tree with the input file in a 
same manner used to associate nodes of the sparse binary tree with the plurality 
- of reference files; and ^ - 

compare a profile resulting &om sl measurable attribute of the 
input file with profiles resulting &om the same measurable attribute of the 
potmtial matches to determine a match. 

112. The apparatus according to claim 111, wherein the ii^ut 
file and the referrace files include digitized sanq>les of time varying signals. 

113. An apparatus for matching an ixiput audio file with a 
plurality of reference audio files, comprising at least one conq)uter configured to: 
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generate an input profile from an input audio file based upon a 
measurable attribute also used to generate a plurality of reference profiles from a 
plurality of reference audio files; 

identify potential matches among the plurality of reference profiles 
with the input profile by processing the input profile in a manner used to 
associate individual of the plurality of reference profiles with nodes of a sparse 
binary tres; J 

-.i'jiit profile with.tfae potmtial m. ; • ' ^ to ?^ftermine 

i i4. AT f rirsaratUi for matching an input aaJi:: ile with a 
piuralir , nrising at least one computer configured to: 

genr-' ■ i;y of reteiv/-*^ t-^-^fiipc -f^^,^ _ ^ ^ 

refererjot; u^^jr ^U-,s usmg a measurable attribute; 

generate a sparse binary tree by applying a pxx>ces5 to uie plurality 
of referrace profiles such that identifications of the plurality of reference profiles 
are associated at resulting nodes of the sparse binary tree; 

generate an input profile fix>m the input audio file using the 
measurable attribute; 

apply the process to the input profile so that associated reference 
' profiles are identified from resulting nodes of the sparse binary tree; and 

compare at least a portion of the input profile with coxresponding 
portions of the identified reference profiles to determine a match. 
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